Add Copeland pairwise scoring as alternative recommendation method by that-github-user · Pull Request #104 · that-github-user/thinktank

that-github-user · 2026-03-28T21:06:41Z

Summary

Copeland scoring: pairwise comparison on tests, convergence, files changed
--scoring copeland flag (default: weighted)
Per-criterion win/loss breakdown (testsWins, convergenceWins, filesChangedWins)
Display shows Copeland table when enabled
8 new tests including: all identical, non-transitive, single agent, dominance

Agent selection: Agent #5 chosen over #3 via manual review (not thinktank scoring — to avoid circular bias in redesigning the scoring system). Agent #5 had more comprehensive tests (21 vs 19) and per-criterion score breakdowns.

Change type

New feature

Related issue

Closes #103

How to test

npm test  # 126 tests pass
thinktank run "task" --scoring copeland -n 3
# Shows Copeland Pairwise Scoring table with +/- per criterion

Breaking changes

This PR introduces breaking changes

🤖 Generated with thinktank (Opus), agent manually selected

Implement social choice theory-based scoring: agents compared pairwise on tests, convergence, and files changed. Per-criterion wins tracked. --scoring copeland flag enables it alongside existing weighted method. Agent #5 chosen over #3 via MANUAL review (not thinktank scoring) — better edge case tests (all-identical, non-transitive, single agent) and per-criterion breakdown in CopelandScore type. Closes #103 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

that-github-user · 2026-03-28T21:08:12Z

Self-review (manual agent selection):

Agent Add CI workflow with build + lint + test on push/PR #5 chosen over Add test-command verification flow #3 via manual code review, NOT thinktank scoring (avoiding circular bias)
Reason: Agent Add CI workflow with build + lint + test on push/PR #5 has 21 tests vs 19, covers all-identical/non-transitive/single-agent edge cases
Copeland implementation: correct pairwise +1/-1/0 tallying
Criteria: tests passed, convergence group size, files changed (fewer = better)
Per-criterion breakdown visible in display (testsWins, convergenceWins, filesChangedWins)
--scoring flag with weighted (default) and copeland options
126 tests pass, CI green

that-github-user merged commit 102b0e6 into main Mar 28, 2026
4 checks passed

that-github-user deleted the issue-103-copeland-scoring branch March 28, 2026 21:08

that-github-user mentioned this pull request Mar 28, 2026

Research: advanced recommendation methods (Copeland, Borda, pairwise comparison) #103

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Copeland pairwise scoring as alternative recommendation method#104

Add Copeland pairwise scoring as alternative recommendation method#104
that-github-user merged 1 commit into
mainfrom
issue-103-copeland-scoring

that-github-user commented Mar 28, 2026

Uh oh!

that-github-user commented Mar 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

that-github-user commented Mar 28, 2026

Summary

Change type

Related issue

How to test

Breaking changes

Uh oh!

that-github-user commented Mar 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant